Policy Gradients Beyond Expectations: Conditional Value-at-Risk
نویسندگان
چکیده
Conditional Value at Risk (CVaR) is a prominent risk measure that is being used extensively in various domains such as finance. In this work we present a new formula for the gradient of the CVaR in the form of a conditional expectation. Our result is similar to policy gradients in the reinforcement learning literature. Based on this formula, we propose novel sampling-based estimators for the CVaR gradient, and a corresponding gradient descent procedure for CVaR optimization. We evaluate our approach in learning a risk-sensitive controller for the game of Tetris, and propose an importance sampling procedure that is suitable for such domains.
منابع مشابه
Presenting a model for Multiple-step-ahead-Forecasting of volatility and Conditional Value at Risk in fossil energy markets
Fossil energy markets have always been known as strategic and important markets. They have a significant impact on the macro economy and financial markets of the world. The nature of these markets are accompanied by sudden shocks and volatility in the prices. Therefore, they must be controlled and forecasted by using appropriate tools. This paper adopts the Generalized Auto Regressive Condition...
متن کاملAsymptotic Analysis of Multivariate Tail Conditional Expectations
Tail conditional expectations refer to the expected values of random variables conditioning on some tail events and are closely related to various coherent risk measures. In the univariate case, the tail conditional expectation is asymptotically proportional to the value-at-risk, a popular risk measure. The focus of this paper is on asymptotic relations between the multivariate tail conditional...
متن کاملPolicy Gradients for CVaR-Constrained MDPs
We study a risk-constrained version of the stochastic shortest path (SSP) problem, where the risk measure considered is Conditional Value-at-Risk (CVaR). We propose two algorithms that obtain a locally risk-optimal policy by employing four tools: stochastic approximation, mini batches, policy gradients and importance sampling. Both the algorithms incorporate a CVaR estimation procedure, along t...
متن کاملSaddlepoint Methods for Conditional Expectations with Applications to Risk Management
The paper derives saddlepoint expansions for conditional expectations in the form of E[X |Y = a] and E[X|Y ≥ a] for the sample mean of a continuous random vector (X,Y) whose joint moment generating function is available. Theses conditional expectations frequently appear in various applications, particularly in quantitative finance and risk management. Using the newly developed saddlepoint expan...
متن کاملThree steps method for portfolio optimization by using Conditional Value at Risk measure
Comprehensive methods must be used for portfolio optimization. For this purpose, financial data of stock companies, inputs and outputs variable, the risk measure and investor’s preferences must be considered. By considering these items, we propose a method for portfolio optimization. In this paper, we used financial data of companies for screening the stock companies. We used Conditional Value ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1404.3862 شماره
صفحات -
تاریخ انتشار 2014